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Abstract. For a family X of fc-subsets of the set {1, . . . ,n}, let \X\ be the cardi- 
nality of X and let r(X,ii) be the expected maximum weight of a subset from X 
when the weights of 1, . . . , n are chosen independently at random from a symmetric 
probability distribution fi on R. We consider the inverse isoperimetric problem of 
finding fi for which T{X,fi) gives the best estimate of ln|Ar|. We prove that the 
optimal choice of fi is the logistic distribution, in which case T{X,fi) provides an 
asymptotically tight estimate of In \X\ as k^^ In grows. Since in many important 
cases T{X,fi) can be easily computed, we obtain computationally efficient approxi- 
mation algorithms for a variety of counting problems. Given fi, we describe families 
X of a given cardinality with the minimum value of T{X,fi), thus extending and 
sharpening various isoperimetric inequalities in the Boolean cube. 



1. Introduction 

Let X be a family of /c-subsets of the set {!,... ,n}. Geometrically, we think of 
X as a set of points a; = (^i, . . . , ^n) in the Hamming sphere of radius k 

^i + ... + ^n = k where ^^€{0,1} for i = l,...,n. 

We also consider general families X of subsets of {!,... , n}, which we view as sets 
X C {0, 1}" of points in the Boolean cube. 

Let us fix a Borel probability measure fi in M. We require fj. to be symmetric, 
that is, n{A) — ij,[—A) for any Borel set A C M, and to have finite variance. 

In this paper, we relate two quantities associated with X. The first quantity is 
the cardinality \X\ of X. The second quantity r(X, //) is defined as follows. Let 



1991 Mathematics Subject Classification. 05A16, 60C05, 60D05, 51F99, 68W20. 

Key words and phrases, combinatorial counting, Hamming metric, exponential measure, lo- 
gistic distribution, polynomial time algorithms, isoperimetric inequalities. Boolean cube, inverse 
problems, stochastic processes. 

The research of the first author was partially supported by NSF Grant DMS 9734138. The 
research of the second author was partially supported by ISF Grant 039-7165. 



1 



Typeset by Am^-TW- 



us fix a measure fx as above and let 71, . . . ,7n be independent random variables 
having the distribution fx. Then 



r(^,/x) = Emax 

a;€A ' ' 

In words: we sample weights of 1, . . . ,n independently at random from the dis- 
tribution fj,, define the weight of a subset a; e X as the sum of the weights of its 
elements and let T{X,fj,) be the expected maximum weight of a subset from X. 

Often, when the choice of fi is clear from the context or not important, we write 
simply T{X). 

It is easy to see that T{X) is well defined, that T{X) — if X consists of a 
single point (recall that jj, is symmetric) and that T{X) > T{Y) provided Y C X. 
Thus, in a sense, T{X) measures how large X is. In some respects, T{X) behaves 
rather like ln|X|. For example, if X C {0,1}" and Y C {0,1}"^, we can define 
the direct product X xY G {0,1}™'+"'. In this case, |X x F| = \X\ ■ \Y\ and 
T{X X Y) ^T{X)+T{Y). 

Our goal can be stated (somewhat vaguely) as follows: 

(1.1) Problem. Find a measure fj, for which r(X, //) gives the best estimate of 
ln|X|. 

Our motivation comes from problems of efficient combinatorial counting. For 
many interesting families X, given a set 71, . . . , 7^ of weights, we can easily find 
the maximum weight of a subset x E X using well-known optimization algorithms. 
The value of r{X, jj) can be efficiently computed through averaging of several sam- 
ple maxima for randomly chosen weights 71, . . . , 7„. At the same time, counting 
elements in X can be a hard and interesting problem. Thus, for such families, 
r(X, //) provides a quick estimate for ln|X|. We give some examples in Section 2, 
where we also argue that the problems of optimization (computing r(X, //)) and 
counting (computing ln|X|) are asymptotically equivalent. 

(1.2) The logistic measure. Let X be a non-empty family of fc-subsets of 
{1, . . . , n}. One of our main results is that there are measures ^ for which r(X, jj) 
gives an asymptotically tight estimate for ln|X| provided ln|X| grows faster than 
a linear function of k. We obtain the best estimates when = //q is the logistic 
measure with density 

^ for 7 e R. 

+ e-"^ + 2 

We prove that for any a > 1 there exists j3 — (3{a) > such that 
/3T(X) < \n\X\ < T{X) provided \X\ > a'' 

and 

P{a) — > 1 as CK — )• +00. 
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Moreover, we prove that ioi t = k ^T{X) we have 

< k~^\ii\X\ <t 

for all sufficiently large t. Note that the bounds do not depend on n at all. 

The existence of such measures /i seems to contradict some basic geometric 
intuition. If we fix the cardinality |X| of a set X in the Hamming sphere, we would 
expect T{X) to be large if X is "random" and small if X is tightly packed. It 
turns out, however, that there are measures that manage to ignore, to some extent, 
the difference between dense and sparse sets. In Sections 4 and 5 we prove some 
general asymptotically tight bounds which allow one to obtain similar estimates 
for a variety of measures n. For example, the measure with density \'-f\e~^"'^ /2 also 
guarantees the asymptotic equivalence of In |X| and r(X). 

We prove that the logistic distribution is, in a well-defined sense, optimal among 
all distributions /j, for which In |X| < T{X, ji) for all non-empty X C {0, 1}": given 
a lower bound for r(X, we get the best lower bound for In |X| when ji is the 
logistic distribution. 

In addition, we prove that the logistic distribution has an interesting extremal 
property: the inequality In \X\ < V{X) turns into equality if X is a face (subcube) 
of the Boolean cube {0, 1}"^. 

We state our results in Section 3. 

The problems we are dealing with have obvious connections to some central ques- 
tions in probability and combinatorics, such as discrete isoperimetric inequalities 
(cf. [ABS98], [Le91], and [T95]) and estimates of the supremum of a stochastic 
process, see [T94]. In particular, in [T94], M. Talagrand considers the functional 
r(X, //), where X is a family of subsets of the set {1, ... , n} and ^ is the symmetric 
exponential distribution with density e~^^\/2. He proves that ln|X| < cT[X) for 
some absolute constant c, see also [La97]. In Section 7, we prove that the optimal 
value of the constant is c = 2 In 2 (the equality is obtained when X is a face of the 
Boolean cube {0, 1}"^). We also prove that ln|X| < r(X) + A;ln2 provided X lies 
in the Hamming ball of radius k (the inequality is asymptotically sharp). 

(1.3) Isoperimetric inequalities. Suppose that ji is the Bernoulli measure: 

//{1}=M-1} = ^. 

This case was studied in our paper [BSOl]. It turns out that r(X) has a simple 
geometric interpretation: the value of 0.5n — r(X) is the average Hamming distance 
from a point x in the Boolean cube {0, 1}"^ to the subset X C {0, 1}". The classical 
isoperimetric inequality in the Boolean cube. Harper's Theorem (sec [Lc91]), implies 
that among all sets X of a given cardinality, the smallest value of T{X) is attained 
when X is the sphere in the Hamming metric. More precisely, let us fix < o: < In 2. 
Then there exists (3 = f3{a)^ < /5 < 1/2, such that if Yn is the Hamming sphere 
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of radius j3n + o(n) in {0, !}"■ then we have ln|l^| = an + o(n) and for any set 
C {0,1}" with ln|X^| = an + o{n), we have T{Yn) < r(X„) + o{n). We 
determine /3 from the equation 

/*lni + (l-/J)ln^ = a 

and note that T{Yn) = l3n + o(n). 

In Section 8, we construct sets F„ with asymptoticaUy the smaUest value of 
T{Yn) for an arbitrary symmetric probabihty measure ^ with finite variance. It is 
no longer true that is a Hamming sphere in {0, 1}"'. For example, if /^{l} = 

— 1} = /u{0} = 1/3 then Y^ has to be the direct product of two Hamming spheres. 
It turns out that for any symmetric ^ with finite variance Y^ can be chosen to be 
the direct product of at most two Hamming spheres. More precisely, let us fix a 
symmetric probability measure and a number < o; < ln2. Then we construct 
numbers Aj = Aj(Q;,/i) > and < /9i = f3i{a^jji) < 1/2 for z = 1,2, such that 
Ai + A2 = 1 and the following holds: if Y^ is the direct product of the Hamming 
sphere of radius /9in + o{n) in the Boolean cube of dimensions Ain + o(n) and the 
Hamming sphere of radius (32n + o{n) in the Boolean cube of dimension A2n + o(n) 
(so that Yn is a subset of the Boolean cube of dimension n) then In jy^l = an + o{n) 
and T{Yn) < T{Xn) + o{n) for any set X„ C {0, 1}"' such that In = an + o{n). 

(1.4) The inverse isoperimetric problem. It turns out that for the inequality 
In |X| < cT{X, n) to hold with some constant c = c(//) for any non-empty family X 
of /c-subsets of {1, . . . ,72}, the measure ^ has to have an at least exponential tail, 
that is, for the cumulative distribution function F of iJ,we must have 1 — F(t) > e~"* 
for some a > and all sufficiently large t. On the other hand, for the lower bound 
of In \X\ in terms of k~^T{X, ji) to be non-trivial (other than 0), ji has to have 
an at most exponential tail. Thus for the solution of Problem 1.1, which we call 
the inverse isoperimetric problem, we are interested in measures with exponential 
tails. 

2. Applications to Combinatorial Counting 

This research is a continuation of [B97] and [BSOl], where the idea to use opti- 
mization algorithms for counting problems was developed. 

First, we discuss how to compute T{X) for many interesting families of subsets. 

Let us assume that the family X of subsets of {1, . . . , n} is given by its Opti- 
mization Oracle. 

(2.1) Optimization Oracle 

Input: Real vector c~ (71, . . . , 7^) 
Output: Real number 

w{X, c) = max 7^. 
x&x ' 
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Thus, we input real weights of the elements 1, . . . ,n and output the maximum 
weight w{X,c) of a subset x G X in this weighting. As is discussed in [B97] 
and [BSOl], for many interesting families X Optimization Oracle 2.1 can be easily 
constructed. We provide two examples below. 

(2.2) Bases in matroids. Let Ahe a k x n matrix of rank k over a field F. We 
assume that k < n. Let X — X{A) be the set of all fc-subsets a; of {1, . . . , n} such 
that the columns of A indexed by the elements of x are linearly independent. Thus 
X is the set of all non-zero k x k minors of A, or, in other words, the set of bases 
of the matroid represented by A. It is an interesting and apparently hard problem 
to compute or to approximate the cardinality of X , cf. [JS97]. 

On the other hand, it is very easy to construct the Optimization Oracle for 
X. Indeed, given real weights 7i, . . . ,7n, we construct a linearly independent set 
a^i,... ,aii^ of columns of the largest total weight one- by-one. First, we choose 
tti^ to be a non-zero column of A with the largest possible weight 71^. Then we 
choose tti^ to be a column of the maximum possible weight such that a^^ and 

(1^2 are 

linearly independent. We proceed as above, and finally select Oj^, to be a column of 
the maximum possible weight such that 0^^, . . . , a^j. are linearly independent; cf., 
for example. Chapter 12 of [PS98] for "greedy algorithms". Particular cases of this 
problem include counting forests and spanning subgraphs in a given graph. 

Let A and B he k x n matrices of rank k < n and let X be the set of all 
/c-subsets X of {1, . . . ,n} such that the columns of A indexed by the elements of 
X are linearly independent and the columns of B indexed by the elements of x 
are linearly independent. Then there exists a much more complicated than above, 
but still polynomial time algorithm, which, given weights 71, . . . ,7n, computes the 
largest weight of a subset x from X, see Chapter 12 of [PS98]. 

(2.3) Perfect matchings in graphs. Let G be a graph with 2k vertices and n 
edges. A collection of k pairwise disjoint edges in G is called a perfect matching 
(known to physicists as a dimer cover). It is a hard and interesting problem to 
count perfect matchings in a given graph, see [JS97]. Recently, using the Markov 
chain approach, M. Jerrum, A. Sinclair and E. Vigoda constructed a polynomial 
time approximation algorithm to count perfect matchings in a given bipartite graph 
[JSVOl], but for general graphs no such algorithms are known. 

There is a classical 0{n^) algorithm for finding a perfect matching of the maxi- 
mum weight in any given edge- weighted graph, see Section 11.3 of [PS98], so Oracle 
2.1 is readily available. 

For any set X given by its Optimization Oracle 2.1, the value of T{X) can be 
well approximated by the sample mean of a moderate size. 

(2.4) Algorithm for computing r(X, //) 



Input: A family X of subsets of {1, . . . ,n} given by its Optimization Oracle 2.1; 
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Output: A number w approximating T{X,fj,); 

Algorithm: Choose a positive integer m (see Section 2.5 for details). Sample 
independently m random vectors Ci from the product measure in R"^. For each 
vector Ci, using Optimization Oracle 2.1, compute the maximum weight w{X,Ci) 
of a subset from X. Output 

w = — y^w{x,ci). 

1=1 

(2.5) Choosing the number of samples m. Let us consider the output 

w = w{X; Ci, . . . ,Cm) 
of Algorithm 2.4 as a random variable on the space 

^ V ' 

m times 

endowed with the product measure Clearly, the expectation of w is r(X, //). 

Let D — E(7^) be the variance of jjL. Using the estimates 

(E^O -(El^^l) ^"E^*" {!,..., n}, 

iEx i=l i=l 

we conclude that the variance of w does not exceed n^D/m. Therefore, by Cheby- 
shev's inequality, for the output w to satisfy \w — T{X, fJ,)\ < e with probability at 
least 2/3, we can choose m = [3e~^n^-D]. 

As usual, to achieve a higher probability 1—5 of success, we can run the algorithm 
0{ln5~^) times and then find the median of the computed estimates. 

For many measures ^ the bound for m can be essentially improved. In particular, 
we are interested in the case of the logistic measure n with density + + e~'^)~^ . 
To obtain the desired estimate we use a concentration property of the symmetric 
exponential measure v with density e~^^\/2, see Section 4.5 of [LedOl]. 

Let us define 

r7-ln(2-e^) if 7 < 
l7 + ln(2-e-T) if 7 > 

and 

*(c) = (V'(7i)>--- ,V'(7n)) for c= (71,... ,7n). 

Then 1/^(7) has the logistic distribution if 7 has the exponential distribution v. 
Thus we can write 



T{X,fx) = Ew{X;^ici),... ,^{cm)), 
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where vectors (ci, . . . , c^) are sampled from the exponential distribution i/®'^^ in 
]^nm If X is a family of A;-subsets then the Lipschitz coefficient of 

/(Ci, ...,C^)^w{X- *(ci), . . . , ^{Cm)) 

with respect to the £^ metric of M"^"^ does not exceed 2^kjm while the Lipschitz 
coefficient with respect to the £^ metric does not exceed 2/m. Applying Proposition 
4.18 of [LedOl], we conclude that for the output w of Algorithm 2.4 to satisfy 
\w — r(X, \i)\<e with probability at least 2/3, we can choose m = 0(A;e~^). Note 
that the choice of m is independent of the size n of the ground set. 

We observe that it is easy to sample a random weight 7 from the logistic dis- 
tribution provided sampling from the uniform distribution on the interval [0, 1] is 
available (which is the case for many computer packages). Indeed, if ^ is uniformly 
distributed on the interval [0, 1], then 7 = In^— ln(l— ^) has the logistic distribution. 

Our numerical experiments suggest that the choice of m = 0(1) (for example, 
m = 5) is good enough and that in many cases m = 1 suffices. 

(2.6) Counting with multiplicities. Suppose that every element i of the ground 
set {1, . . . ,n} has a positive integer multiplicity qi. Let X be a family of A;-subsets 
of {1, ... , n} and let 

Px{qi,-- - ,qn) = ^ Ylqi 

It may be of interest to compute or approximate px- 

For instance, let A = (a^ ) be a 2A;x2A; symmetric matrix of non-negative integers 
aij. Let us construct an (undirected) graph G on 2k vertices {1, . . . , 2k} where the 
vertices i and j are connected by an edge if and only if aij > 0. We identify the 
edges of G with the set {1, . . . , n}. Let X be the set of all perfect matchings in G 
identified with a family of /c-subsets of {1, . . . ,n}, see Example 2.3. If we assign 
multiplicities aij to the edges of G, then the value of px{(iij) is called the hafnian 
haf ^ of ^, a polynomial of a considerable interest which generalizes permanent. 

Computing (o'l , • • • , Qn) is reduced to counting in the following straightforward 
way. Let N = qi + . . . + and let us view the set {1, . . . , A^} as the multiset 
consisting of gi copies of 1, 52 copies of 2, . . . , g„ copies of n. Let us construct a 
family Y of A;-subsets of {1, . . . , N} as follows: for each A;-subset x E X we construct 
riiex fc-subsets y &Y by replacing every i & x by any of its qi copies. It is clear 
that |y| =px{qi, . . . ,qn). 

To construct Optimization Oracle 2.1 for y, we apply the oracle for X with the 
input c = (71, . . . , 7n), where 7i is the maximum of qi weights assigned to the qi 
copies of i. Moreover, Algorithm 2.4 is easily modified for computing r(y, 11) instead 
of r(X, //). We still work with the underlying family X, but instead of sampling 
weights from the distribution |U, we sample the i-th weight 7^ from the distribution 
/Uq^ of the maximum of qi independent random variables with the distribution ^ 
(note that /Xg. is not symmetric for qi > 1). Thus, if /x is the logistic distribution, 
to sample 7^, we sample ^ from the uniform distribution on [0,1] and let 7^ = 
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— ln(^~^/^* — l). Luckily, for the logistic distribution the required number m of 
calls to Oracle 2.1 does not depend on the size of the ground set, hence we use the 
same number m of calls whether we consider counting with or without multiplicities. 

In [BSOl] we discuss how our approach fits within the general framework of 
the Monte Carlo method. The estimates we get are not nearly as precise as those 
obtained by the Markov chain based Monte Carlo Method (see, for example, [JS97]), 
but supply a non-trivial information and are easily computed for a wide variety of 
problems. Even for the much-studied problem of counting perfect matchings in 
general (non-bipartite) graphs our approach produces new theoretical results. For 
some of the problems, such as counting bases in the intersection of two general 
matroids (see Example 2.2), our estimates seem to be the only ones that can be 
efficiently computed at the moment. If X is a family of A;-subsets of {1, ... , n} 
and \X\ — e^'^ for some A = A(X) then, in polynomial time, we estimate X{X) 
within a constant multiplicative factor as long as \{X) is separated from and 
all sufficiently large \{X) are estimated with an additive error of 1 -|- In A(X), see 
Section 3. Similar estimates hold for counting with multiplicities of Section 2.6. On 
the other hand, the Markov chain approach, if successful, allows one to estimate the 
cardinality \X\ within any prescribed relative error. We note that for truly large 
problems the correct scale is logarithmic because \X\ can be prohibitively large to 
deal with. The Markov chain approach relies on the local structure of X (needed 
for "rapid mixing" ) , whereas our method uses some global structure (the ability to 
optimize on X efficiently). 

(2.7) Asymptotic equivalence of counting and optimization. One can view 
the optimization functional max^^gx X^iex 7* "tropical version" of the polyno- 

mial px^Qii ■ ■ ■ 1 Qn) of Section 2.6: we get the former if we replace "-I-" with "max" 
and product with sum in the latter. Thus our results establish a weak asymptotic 
equivalence of the counting and optimization problems: if we can optimize, we 
can estimate Inpx with a relative error which approaches as k~^\npx grows. 
Vice versa, if we can approximate Inpx, we can optimize (at least approximately): 
choosing qi(t) — 2*"''*, we get 

lini t"Mog2Px(?i(t),--- ,gn(^)) =maxV7i. 

t >-|-oo xeX ^ — ' 

A. Yong [Y03] implemented our algorithms for some counting problems, such as 
estimating the number of forests in a given graph, computing the permanent and 
the hafnian of a given non-negative integer matrix and performed a number of 
numerical experiments. The algorithm produces the upper and lower bounds for 
the logarithm of the cardinality of the family in question, see Section 3. The upper 
bound is attained when the family is "tightly packed" as a subset of the Boolean 
cube whereas the lower bound is attained on sparse families. It appears that there 
is some metric structure inherent to various families of combinatorially defined 
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sets. For example, when we applied our methods to estimate the logarithm of the 
number of spanning trees in a given connected graph, the exact value (which can 
be easily computed by the matrix-tree formula) turns out to be very close to the 
upper bound obtained by our algorithm. Informally, spanning trees appear to be 
"tightly packed". On the other hand, when we estimated the logarithm of the 
number of perfect matchings in a graph, the true value (when we were able to find 
it by other methods) seems to lie close to the middle point between the upper and 
lower bounds. 

3. The Logistic Measure: Results 
Let us choose /xq with density 

^ for 7 e R. 



e7 + e-T + 2 

The cumulative distribution function F of is given by 

FM = for 7 e R. 

The variance of /iq is 7r^/3 [M85]. Our first main result is as follows. 

(3.1) Theorem. 

(1) For every non-empty set X C {0, 1}", we have 

\n\X\ < T{X); 

(2) Let 

sin ttS - 



h{t) = sup [St + — for t> 0. 



Then h(t) is a convex increasing function and for any non-empty family X 
of k-suhsets o/ {1, . . . , n}, we have 

hit) < k-^ In \X\ where t = k-^T{X). 
Prom the expansion 

, sin TTX TT^ 9 /I, ^ 

In = + O(x^) for x^O, 

TTX 6 

we deduce that 

h{t) = ^t^ + 0{t^) for t 
27r 
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(we substitute 5 = (3t/7r^)). From the expansion 

, sinTrfl — x) , o^ 

In ^ = \nx + x + 0(x^) for a; f=i 0, 

TT[1 — X) 

we deduce that 

h{t) > t — liit — 1 as t — > +00 

(we substitute 6=1 — t~^). 

A Maple pfot of h{t) is shown on Figure 1 befow. 




We obtain the foUowing corollary. 

(3.2) Corollary. For any a > 1 there exists (3 = f3{a) > such that for any 
non-empty family X of k- subsets o/{l, ... , n} with \X\ > we have 

I3T{X) < ln|X| < T{X). 

Moreover, 

P{a) — > 1 as a — > +oo. 

Proof. From Part (1) of Theorem 3.1, we have k~^r{X) > In a. Since h{t) is 
convex, we have h{t) > pt for some /3 = P{a) > and all t >liia. The asymptotics 
of P{a) as a — > +oo follows from the asymptotics of h{t) as t — +oo. □ 

Thus, using the logistic distribution allows us to estimate In |X| within a constant 
factor and the approximation factor approaches 1 as In |X| grows. 
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We note that the bound ln|X| < r(X) is sharp. For example, if X is an m- 
dimensional face of the Boolean cube then ln|X| = mln2 and one can show that 
T{X) = mln2 as well. Indeed, because T'{X) is invariant under coordinate permu- 
tations, we may assume that X consists of the points (^i, . . . , 0, . . . ,0), where 

e {0, 1} for z = 1, . . . ,m. The set X can be written as the Minkowski sum 
X — Xi + . . . + Xjn, where Xi consists of the origin and the z-th basis vector e^. 
Hence T{X) = mT{Xi) (cf. Section 4.1) and T{Xi) is computed directly as 



(we substitute = y and then integrate by parts). 

It turns out that the logistic measure is optimal in a well-defined sense. 

(3.3) Theorem. Let M. he the set of all measures /x such that 



for any non-empty family X of k-suhsets of {1, . . . ,n}, any n > 1, and any 1 < 
k <n. 

For a measure fj, & A4 and a number t > 0, let c{t, //) be the infimum ofk~^ In \X\ 
taken over all n > 1, all 1 < k < n, and all non-empty families X of k-subsets 
{1, ... ,n} such that k~^r{X, /j,) > t. Then for all t > 



where fiQ is the logistic distribution. 

(3.4) Discussion. Unless /i is concentrated in 0, for X = {0, 1} we have r(X, fi) = 
cln2 for some c > and hence r{X, ^) = cln |X| if X is a face of the Boolean cube 
{0, 1}"^, cf. Section 4.1. As we are looking for the best measure // in Problem 1.1, it 
is only natural to assume that r(X, ji) > ci In |X| for all X C {0, 1}", which, after 
scaling, becomes r(X, /i) > ln|X|. This explains the definition of M.. 

Let us choose fi E Ai. Then any upper bound for r{X,^) is automatically 
an upper bound for ln|X|. The function c{t,fi) measures the quality of the lower 
bound estimate for In |X| given a lower bound for r{X, /x). 

Incidentally, it follows from our proof that the logistic measure is the measure 
of the smallest variance in M.. 

We prove Theorems 3.1 and 3.3 in Section 6. 



It is convenient to think about families X geometrically, as subsets of the Boolean 
cube {0, 1}"^ C M". Let us fix a symmetric probability measure /U on M with finite 
variance and let /x®"^ be the product measure on M"^. For a finite set X C we 
write 



where c = (71, . . . ,7^) is a random vector sampled from the distribution /x®"^ on 
and (•, •) is the standard scalar product in W^. 




\n\X\ < r(X,//) 



c{t,iJ,) < c(t,/xo), 



4. General Estimates: the Upper Bound 



T{X) = Emax(c,a;), 
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(4.1) Preliminaries. It is easy to check that 

T{X) > T{Y) provided Y cX 

and that 

T{X)=0 if |X| = 1, 

that is, if X is a point (// is symmetric). It follows that T{X) > for any finite 
non-empty subset X C M". Moreover 

T{X + ¥)= r{X) + T{Y) where X + Y ^ {x + y: x e X, y e Y} 

is the Minkowski sum of X and Y. In particular, T{X + y) = r(X) for any set X 
and any point y. We note that 

r(AX) = |A|r(X) where XX = {Xx : xe X} 

is a dilation of X and that T{X) is invariant under the action of the hyperoctahedral 
group, which permutes and changes signs of the coordinates. 

Let S{k, n) be the Hamming sphere of radius k centered at the origin, that is, the 
set of points x — (^i, . . . , ^„) G {0, 1}^ such that + . . . + ^n = k. Combinatorially, 
S{k,n) is the family of all A;-subsets of {!,... ,n}. 

Let F be the cumulative distribution function of jj.. In this section, we prove the 
following main result. 

(4.2) Theorem. For a non-empty set X C {0, 1}" and a number r > 0, let 

G{X,t) = \n\X\- tV{X). 

Let 

r+oo 

^^(a) = ln(l + e-^") -T / {l - F{t)) dt for a G M. 

J a 

(1) For any non-empty set X C {0, l}*^, we have 



G{X,t) < nsup gr{a); 

a>0 



(2) Suppose that 



supgr{a) — gricio) > for some, necessarily finite, gq > 0. 

a>0 

Then there exists a sequence X^ = S{kn, n) C {0, 1}"^ of Hamming spheres 
such that 

G{Xn,T) 

hni = gr{ao). 

n — >+oo n 

Assuming that F is continuous and strictly increasing, we can choose kn = 
an + o{n) for a = 1 — F{ao). 

Before we embark on the proof of Theorem 4.2, we summarize some useful prop- 
erties of ^r(a)- 
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(4.3) Properties of Qt- We observe that 

p+oo r+oo 

griO) = \n2-T {1- F(t)) dt = \n2-T t dF{t). 
Jo Jo 

Furthermore, 

hm gr{a) = 0, 

a — >+oo 

since /i has expectation. If F{t) is continuous then gj- is differentiable and 

In particular, a is a critical point of grici) if and only if a is a solution of the equation 

ra 

= F{a) 

or, in other words, if 

ar = In F(a)-ln(l-F(a)). 
In particular, a = is always a critical point of gr- 

We prove Part (1) of Theorem 4.2 by induction on n, inspired by Talagrand's 
method [T95]. The induction is based on the following simple observation. 

(4.4) Lemma. Suppose that the cumulative distribution function F is continuous. 
For a non-empty set X C {0, 1}^ , n > 1, let 

Xi = |x e {0,1}"-^ : {x,l)eX^ and 

Xo = {xe{o,ir-^ : {x,0)exy 

Then, for any a gM. we have 

r+oo 

r{X) > (1 - F(a))r(Xi) + F(a)r(Xo) + / t dF{t). 

J a 

Proof Let c = (0,7), where c e W^~^, 7 e R, and let 

w{X, c) = max(a;, c) for c e M". 

Clearly, 

w{X,c) > w{Xi,c) +7 and w{X,c) > w{Xo,c). 
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Therefore, 

r{X) = f w{X,c) dix^-^ic) 

= 1 w{X,c) diJi®''{c)+ I w{X,c) diJi®''{c) 

> / (w{Xuc)+^)d^®^{c) + / w(Xo,c) d^®^{c) 

/• /•+00 

= (1 - F{a)) / c) (i//®--i(c) + / 7 ^^^^(7) 

+ F{a) [ w{Xo,c) d//®^-^(c) 

/■+00 

= (1 - F(a))r(Xi) + F(a)r(Xo) + / 7 ^^^"(7), 

J a 

and the proof follows. □ 

(4.5) Lemma. Suppose that the cumulative distribution function F is continuous. 
For a non-empty set X C {0, !}"■ and a number r > let G{X,t) and grid) be 
defined as in Theorem 4-2. Then for any non-empty set X C {0, 1}"', n > 1, there 
exists a non-empty set Y C {0, l}""-*^ such that 

G{X,t) <G{Y,t) + sup gr{a). 

a>0 

Proof. Let us construct Xi and Xq as in Lemma 4.4. We have 

\Xi\^X\X\ and \Xo\ = {1 - X)\X\ for some < A < 1. 

Without loss of generality, we assume that < A < 1/2. Otherwise, we replace X 
hy X' , where 

Clearly, \X\ = \X'\ and by Section 4.1, T{X) = T{X'). 

If A = we choose Y = Xq- Identifying W^~^ with the hyperplane ^^j, = in M"", 
we observe that X — Y and so G{X, r) = G{Y, r). Since by Section 4.3 we have 

sup5r^(a) > 0, 

o>0 

the result follows. 

Thus we assume that < A < 1/2. Let Y G {^o, -^1} be the set with the larger 
value of r), where the ties are broken arbitrarily. We have 

\X\ = hxi\ and |X| = -^|Xo|. 
A 1 — A 
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For any a > 

G{X, r) - In |X| - tT{X) = (l - F{a)) ln\X\+ F{a) \ii\X\ - tT{X) 
= (1 - F(a)) In \Xi\ + F{a) In |Xo| 

+ ((1 - F(a)) In i + F(a) In ^ - Tr(X). 

By Lemma 4.4 we conclude that 

G{X,t) <(1 -F(a))ln|Xi| + F(a)ln|Xo| 

+ ((1 - F{a)) In i + F{a) In 

A i — A 

r+oo 

- (1 - F(a))rr(Xi) - F{a)TT{Xo) - r / t 

J a 

= (1 - F(a))G'(Xi,T) + F{a)G{Xo,T) 

1 1 P+OO 

+ (1 - F(a)) In - + F(a) In - r / t 

^ 1 — A 

1 1 /■+°° 

<G'(y, r) + (1 - F(a)) In - + F(a) In - r / t 

1 — A 

Optimizing in a, we choose 

(4.5.1) a=^ln(^^-^y so a > 0. 

Then 

- = 1 + e™ and = . 

A 1 - A 

Hence 

G{X, r) <G{Y, r) + ln(l + e™) - TaF(a) - r / t 

^ a 

r+oo 

=G{Y, t) + ln(l + e"™) + ra(l - F(a)) + r / t d{l - F{t)) 

J a 

p+oo 

=G{Y, r) + ln(l + e"™) - r / (l - 
=G{Y,T)+gr{a), 

as claimed. □ 

Now we are ready to prove Part (1) of Theorem 4.2. 

Proof of Part (1) of Theorem 4-2. Without loss of generality, we may assume that 
the cumulative distribution function F is continuous. The proof follows by induction 
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on n. For n = 1, there are two possibilities. If |X| = 1 then G{X, r) = (see Section 
4.1) and the result holds since 

swpgr{a) > 0, 

a>0 

see Section 4.3. If |X| = 2 then X = {0, 1} and 

+ 00 



G'(X,t) = ln2 -T / t dF{t) = gr{0), 
Jo 



so the inequality holds as well. 

The induction step follows by Lemma 4.5. □ 

Let S{k, n) be the Hamming sphere of radius k, that is, the set of all A;-subsets of 
{1, . . . , n}. Given weights 7i, . . . , 7n, the maximum weight of a subset x e S{k, n) 
is the sum of the first k largest weights among 71, . . . , 7n- 

The proof of Part (2) of Theorem 4.2 is based on the following lemma. 

(4.6) Lemma. Suppose that the cumulative distribution function F of is strictly 
increasing and continuous. Let us choose < a < 1 and let X^ be the Hamming 
sphere of radius an + o{n) in {0, 1}"^. 
Then 

lim = t dF{t). 



n — >+oo n 



Hl-a) 



Proof. Let 71 , . . . , 7n be independent random variables with the distribution fi and 

let ui-n < U2:n < • • • < Un:n be the Corresponding order statistics, that is, the 
permutation of 71, . . . , 7n in the increasing order. Then 

n 

Jta; m=n—oin-\-o\n) 

Consequently, V{Xn) is the expectation of the last sum. 

The corresponding asymptotics for the order statistics is well known, see, for 
example, [S73]. □ 

Now we are ready to complete the proof of Theorem 4.2. 

Proof of Part (2) of Theorem 4-2. Without loss of generality, we assume that the 
cumulative distribution function F of is continuous and strictly increasing. Let 
us choose a and kn as described, so X„ C {0, 1}"' is the Hamming sphere of radius 

an + o{n) in {0, 1}"'. 

As is known (sec, for example. Theorem 1.4.5 of [Li99]), 

nm =a m h (1 — a) m 

n — >+oo n a 1 — a 

1 
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Moreover, by Lemma 4.6, 



n-^+oo n Ja„ 



' ao 

Hence 

n — >+oo n 

1 1 /' + °° 

On the other hand, 

r+oo 

Qria) =ln(l + e"^") - r / (l - 

J a 



= ln(l + e"™) + Ta(l - F(a)) - r / t dF{t). 

J a 

Since ao is a critical point of gr, we have 

Tao = lnF(ao)-ln(l-F(ao)), 
cf. Section 4.3. Therefore, 

gr{ao) = - InF(ao) + (inF(ao) - ln(l - F(ao))) (l - F{ao))-T ! t dF{t) 

J ao 

1 1 f^'^ 

and the proof foUows. □ 

Some remarks are in order. 
(4-7) Remarks. 

(4.7.1) Optimizing in a in Lemma 4.4, we substitute a — r(Xo) — r(Xi) and 
obtain the inequality 

r+oo 

T{X) > (1 - F(a))r(Xi) + Fia)T(Xo) + / t dF{t) 

J a 

=r(Xo) + / (1 - F{t)) dt. 

Jr(Xo)-r(Xi) 

This inequality is harder to work with than with that of Theorem 4.2 but it some- 
times leads to more delicate estimates, see Section 7. 

(4.7.2) M. Talagrand proved in [T94] that for every non-empty set X of subsets of 
{1, ... , n} there is a "shifted" set X' of subsets of {1, ... , n} such that \X'\ = \X\, 
r(^') ^ r(X), X' is hereditary (that is, \{ x E X' and y d x then y G X') and 
left-hereditary (that is, if a; G X', i E x, j ^ x and i < i then the subset xU{j}\ {i} 
also lies in X'). 
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5. General Estimates: the Lower Bound 

Let us fix a symmetric probabiUty measure fx with the cumulative distribution 
function F. In this section, we prove the following main result. 

(5.1) Theorem. Assume that the moment generating function 

/ + 00 
-oo 

is finite in some neighborhood of 6 — 0. Let 

h{t,fi) =h{t) =svLp(6t- In L{d)) for t>0. 

(1) For any non-empty family X of k- subsets of {1, . . . ,n}, we have 

k-^ In |X| > h{t) for t = k-^T{X); 

(2) For any t > such that F{t) < 1 and for any < e < 0.1 there exist 
k = k{t, e, fj), n = n{k), and a family of k-subsets of the set {1, ... ,n} such 
that 

k-^T{X)>{l-e)t and k-Hii\X\ < h{t) + e. 

Before proving Theorem 5.1, we summarize some properties of L{d) and h{t). 

(5.2) Preliminaries. Let f{6) = In L{6). Thus we assume that f{5) is finite on 
some interval in M, possibly on the whole line. It is known that f{6) is convex 
and continuous on the interval where it is finite, see, for example. Section 5.11 of 
[GSOl]. Since fj, is symmetric, we have /(O) = and from Jensen's inequality we 
conclude that f{5) > for all 5. 

The function h{t) is convex conjugate to f{S). Therefore, h{t) is finite on some 
interval where it is convex, continuous and approaches +oo as t approaches a bound- 
ary point not in the interval. Besides, 

+2 

h{t) = TjT^ + 0{t^) for t ^ 0, 

where D is the variance of \i. In particular, /i(0) = and h{t) is increasing for 
t > 0, see Section 5.11 of [GSOl]. 

Now we are ready to prove Theorem 5.1. 
Proof of Theorem 5.1. 
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Let us prove Part (1). Without loss of generality, we assume that T{X) > 0. Let 
us choose a positive integer m, let N = nm, K = km and let 



y = X X . . . X X c {0, 1} 



JV 



m times 



Let us pick a point y = (xi, . . . , Xm) from Y, where G X for z = 1, . . . , m. Thus 
some K coordinates of y are I's and the rest are O's. Let us endow with the 
product measure /x®^ and let 71 , . . . , be independent random variables with 
the distribution /i. Then, for any t > 

K K ^ 

P|c e : (c,y) > mt} = P{$^7^ > mt^ = P{5^7i > K-^ 

i=l i=l 

By the Large Deviations Inequality (see, for example. Section 5.11 of [GSOl]) 

K 

PjE^i > K-] < exp{-Kh{t/k)}. 

i=l 

Therefore, 

pjceM^: max(c,2/) > mtj < |y|exp{-K/i(t//c)} = exp{-A;/i(t//c)})"'. 
Since a vector c e is an m-tuple c = (ci, . . . , c^) with Cj e and 

max(c, 2/) = V max(Q, a;), 
■^^ — ' xex 

i—l 

the last inequality can be written as 

p|ci,...,c^: — max(ci, g;) > t| < exp{ — /c/i(t/fc)}^ . 
However, by the Law of Large Numbers 

^ m 

— max(ci, x) — > r(-^) in probability 
as m — > +00. Therefore, for any < t < r(X), 

^ m 

P< ci, . . . , Cm : — / max(ci, x) > t> — > 1 as m — > +00. 
I m ^ x€X ) 

t—i 
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Therefore, we must have 



\X\ exp{-kh{t/k)} > 1 for every t < T{X). 

Hence 

k-^ ln\X\> h{t) for every t < k-^T{X), 

and the proof follows by the continuity of h, cf. Section 5.2. 

Let us prove Part (2). Let 71, . . . , 7fc be independent random variables having 
the distribution fi. By the Large Deviations Theorem (see Section 5.11 of [GSOl]), 
if /c = k{e, t, n) is sufficiently large then 



k 

P ' 



> h} > exp{-k{h{t) + e/2)}. 



We make k large enough to ensure, additionally, that (in 3 + lnln(l/e))/A; < e/2. 
Let |X| be the largest integer not exceeding 

31n^exp{/i;(/i(t) +e/2)}, 

so A;~-'^ln|X| < h{t) + e, and let X consist of \X\ pairwise disjoint A;-subsets of 
{1, . . . , n} for a sufficiently large n = n{k). 

Suppose that c = (71, . . . , 7„) is a random vector of independent weights with 
the distribution fx. Since x E X are disjoint, the weights J^iex^i subsets from 
X are independent random variables. Let w{X, c) be the largest weight of a subset 
X e X. We have 

P|c : w{X,c) < kt^ < (^1 -exp{-A;(/i(t) + e/2)}y^' < e/2. 

Similarly (since fj, is symmetric): 

Pjc: w{X,-c) <kt^ <e/2, 

and, therefore, 

P|c: w(X,c) + w(X,-c) <2kt^ <e. 

Since w{X, c)+w{X, — c) is always non-negative, its expectation is at least {l—e)2kt. 
On the other hand, this expectation is 2r(X). Hence we have constructed a family 
X of A;-subsets such that 

k-^r{X) > {1 - e)t and k'^ \ii\X\ < h{t) + e. 

□ 
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(5.3) Remarks. 

(5.3.1) Using the convexity of h{t), one can extend the bound of Part (1) of 
Theorem 5.1 to famihes X of at most /c-element subsets of {1, . . . ,n}. 

(5.3.2) Suppose that the moment generating function L(5, /x) is infinite for aU 5 
except for 5 = 0. Let us choose t > and < e < 0.1. We claim that there exists 
a family X of A;-subsets of {1, . . . , n} such that 

k-^T[X)>{l-e)t and k-^hi\X\<e 

(in other words, we can formally take h{t) = in Part (2) of Theorem 5.1). Let 7 
be a random variable with the distribution ji. For c > 0, let 7c be the truncation 
of 7: 

7, if I7I < c 
0, if I7I > c. 



7. 



c 



Let He be the distribution of 7c. It is not hard to see that r(X, //) > r(X, (Uc) 
(consider T[X) as the expectation of 0.5t(;(X, c) + 0.5w(X, — c), where w(X, c) is 
the maximum weight of a subset a; e X for the vector c = (71, . . . , 7^) of weights). 
Choosing a sufficiently large c brings h{t, jic) arbitrarily close to 0. Then we con- 
struct a set X as in Part (2) of Theorem 5.1. 

(5.3.3) Our proof of Part (2) of Theorem 5.1 seems to require n to be exponen- 
tially large in k. This is not so, since every suitable pair n, k can be rescaled to 
a suitable pair N = nm, K = km for a positive integer m. Let X be a family of 
A;-subsets of {1, ... ,n} constructed in the proof of Part (2) and let 

Y = X X ...X X c {0,1}^. 

^ V ' 

m times 

Then y is a family of K-subsets of {1, . . . , N} and 

K-^T{Y) > {1 - e)t and K-Hii\Y\ < h{t) + e. 

6. The Logistic Measure: Proofs 

In this section, we prove Theorems 3.1 and 3.3. 

Proof of Theorem 3.1. To prove Part (1), let us choose r = 1 in Part (1) of Theorem 
4.2. We have 

/•+00 g-t 

gi{a) = ln(l + e"") - / ^ = for all a. 

J a 1 + 6 

Hence 

ln|X| < V{X) 

as claimed. 
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To prove Part (2), we use Part (1) of Theorem 5.1. The moment generating 
function of the logistic distribution is given by 



r+oo Sx ^ 

= / -x^o = for -1<S<1, 

see [M85]. Hence the formula for h{t) follows. 

It follows from Section 5.2 that h is convex and increasing. □ 

Now we are ready to prove optimality of the logistic distribution. 

Proof of Theorem 3. 3. Let us choose n E A4 and let be the cumulative distribu- 
tion function of /j,. We claim that -F^(t) < 1 for all t G M. To see that, we let r = 1 
in Theorem 4.2. If = 1 then gi{t) > and, by Part (2) of Theorem 4.2, there 

is a set X C {0, 1}"^ with In |X| > r(X), which contradicts the definition of M.. 

Let us assume first that the moment generating function L{S, //) is finite in some 
neighborhood of 5 = 0. Then, by Theorem 5.1, we have c(t, fx) = h{t^ fx) and hence 
we must prove that h{t, fx) < h{t,fj,o), where fxo is the logistic distribution. 

Let 

r+oo 

T{a,fx)= / (l-F^(t)) dt. 

J a 

We can write 

/■ + 00 /■+00 -1 /■ + 00 

/ e'- dF^ix) = - / e'- d{l - F^{x)) - - + / 5e'- {l - F^{x)) 
Jo Jo ^ Jo 

= -+ Se^"" d{-T{x, //)) = -+ ST{0, ii) + / 6^e^''T{x, /i) dx. 
^ Jo ^ Jo 



dx 



Similarly, 

/O /•+00 1 r+oo 

e-^^ dF^,{x) = / e-^^ dFi,{x) = -- 5T{0, fx) + / 5'e-^''T{x, fx) dx. 
-oo Jo ^ Jo 

Therefore, 

L{5, ii) = l + 5' / (e-"^^ + e'^^)T(a;, /i) dx. 
Jo 

Since In \X\ < T{X), by Part (2) of Theorem 4.2 we conclude that 

T(a, ii) > ln(l + e"") = T(a, /lo) for aU a > 0. 

Therefore, L(5, /x) > L(5, (xq) and /u) < fxo) for all t > 0, as claimed. 

Suppose now that the moment generating function L{d,ix) is infinite for S 0. 
Then, as follows from Remark 5.3.2, c(t, //) = for all t > 0, which completes the 
proof. □ 
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7. The Exponential Measure 
Let us choose /j, to be the measure with density 



-e-lTl for 7 e R. 

As we have aheady mentioned, one of the results of [T94] is the estimate 

ln|X| < cT{X) 

for some absolute constant c. In this section, we find the optimal value of c and 
establish some general isoperimetric inequalities which, we believe, are interesting 
in their own right. 

(7.1) Theorem. Let ^ he the measure with density e~^^\ /2 for 7 G M. 

(1) Let X C {0, !}"■ be a non-empty subset of the Boolean cube. Then 

\ii\X\ < (21n2)r(X); 

(2) Let X C {0, 1}"" be a non-empty subset of the Boolean cube such that ^1 + 
• • • + < ^ for every (^1, . . . £ ^ ■ That is, X lies in the Hamming 
ball of radius k and we may interpret X as a family of at most k-element 
subsets of {1, . . . ,n}. Then 

lii\X\ < r(X) +/cln2. 

Before we prove Theorem 7.1, we note that c = 21n2 is the best possible value 
in Part (1). If X is a m-dimensional face of the Boolean cube then In \X\ = mln2 
and we show that r{X) — m/2, so the equality holds. As in Section 3, it suffices 
to check the formula for X = {0, 1}, in which case 

1 f+°° 1 
^ ^ 27o 2 

The inequality of Part (2) is asymptotically sharp: if X is the Hamming sphere of 
radius k = o{n) in {0, 1}"', then 

r(X) =ln|X| -/cln2 + o(/i;) as k — > +00, 

cf. Lemma 4.6. 

As for the lower bound, using Part (1) of Theorem 5.1 one can show that for any 
non-empty family X of /c-subsets of {1, . . . , n}, we have 



k-'^\n\X\ > h{k-^T{X)), 
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where 



h{t) =\/l + t2 + ln(Vl + t2 _ - 21nt + ln2 - 1 
—t — liit — 0{l) for large t. 

Thus the exponential distribution also allows us to estimate In |X| up to a constant 
factor. However, the estimates are not as good as for the logistic distribution. 

Proof of Theorem 7.1. To prove Part (1), we use Part (1) of Theorem 4.2. 
The function grici) is given by 

^^(a) = ln(l + e-^")- ^e"'* for a e M. 

Let us consider the critical points of Qt- 
We have 

, T/e(^-^)'* + e-'*-2\ 

iT^^ )■ 

Since the numerator of the fraction is a linear combination of two exponential 
functions and a constant, it can have at most two real zeros. We observe that a = 
is a zero and that 5'^-(a) < for small a > provided r < 2. 

Hence for r < 2 the function g^- has at most one critical point a > which has 
to be a point of local minimum. 

Therefore 

sup5fT-(a) = max{5fT-(0), O} for all r < 2. 

o>0 

Let us choose r = 21n2. Then QriO) = and we conclude that 

sup^^(a) = 0. 

a>0 

By Part (1) of Theorem 4.2, we conclude that 

ln|X| < tT{X) = (21n2)r(X). 

We prove Part (2) by induction on n. If n = 1, there are two cases. If X 
consists of a single point then T{X) = 0, In |X| =0 and the inequality is satisfied. 
If X = {0, 1} then k = 1 and T{X) = 1/2, hence the inequality holds as well. 

Suppose that n > 1. Clearly, we can assume that k > 0. Without loss of 
generality, we may assume that X is hereditary, see Remark 4.7.2. Let us construct 
sets Xq, Xi C {0, 1}"^"^ as in Lemma 4.4. We note that Xq lies in the Hamming ball 
of radius k and Xi lies in the Hamming ball of radius k — 1. Since X is hereditary, 
Xi a Xq. Therefore, 

|Xo|>|Xi| and r(Xo)>r(Xi). 
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The inequality of Remark 4.7.1 gives us 

T{x) > r(Xo) + ^ exp{r(Xi) - r(Xo)}. 

Let us consider a function 

It is easy to see that for every a the function is increasing in b and that for every b 
it is increasing on the interval a > 6 — In 2. 

Applying the induction hypothesis to Xq and Xi, we conclude 

/(r(Xo), r(Xi)) >/(r(Xo), ln|Xi| - (/c- l)ln2) 

>/(ln|Xo| -A;ln2, ln|Xi| - (/c- l)ln2). 

Therefore, 

T{X) >ln|Xo| - /cln2 + ^ = fln|X| - /cln2) + & -ln\^P^) 

\Xo\ \ J ^\Xo\ \Xq\ / 

= fln|X| - A;ln2) + (t-ln(l + t)) for 

>ln|X| - kln2. 

The proof now follows. □ 

8. An Asymptotic Solution to the Isoperimetric Problem 

In this section, we discuss what sets X^ C {0, 1}" with the smallest ratio 
r(X„, fi) j In may look like. We claim that for any symmetric probability mea- 
sure \x with finite variance and for a sufficiently large n we can choose X„ to be the 
product of at most two Hamming spheres. 

(8.1) Theorem. Let us fix a symmetric probability measure jj, and a number 

< a < ln2. 

Then there exist numbers Pi, Xi, i = 1,2, depending on a and fj, only, such that 

0<A<Ai for 1 = 1,2, 
Ai + As = 1 

and the following holds. 

Let Sl^ be the Hamming sphere of radius (3in + o(n) in the Boolean cube of 
dimension Ajn + o{n), i = 1,2, and let = x be the direct product of the 
spheres considered as a subset of the Boolean cube of dimension n. 
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Then 

In \ Yn\ = an + o{n) 
and for any sequence of sets X„ C {0, such that 

In \Xn\ = an + o(n), 

we have 

T{Yr„fi)<T{Xn,fi) + oin). 

Proof. Let F be the cumulative distribution function of /x. Without loss of gener- 
ality, we assume that F is continuous and strictly increasing. Given jj, and a, let 
us consider the function 

^ ^ J X 

of two variables r > and x > 0. 

By Part (1) of Theorem 4.2, for any r > 0, 

(8.1.1) n~^r(X„) > inf H{t, x) provided In = an + o{n). 

x>0 

We claim that there exists < tq < +00 such that 

inf H{to, x) > inf H{t, x) for all r > 0. 

x>0 x>0 

Indeed, since ct < ln2. 



Also, 



inf H(t, x) — > —00 as r — > + 

a;>0 ^ ^ 



inf H(t, x) — > as T — > +00. 

x>0 



On the other hand, choosing xi > such that 

»+oo 



r+00 

/ (1 - F{t)) dt = S>0 

J Xl 



and Ti such that 

a-ln(l + e"^^'^^) > and T^^\a-ln2\<S 

we observe that 

inf H{ti,x) > 0, 

a;>0 
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which imphes that there exists < tq < +00 maximizing infa;>o H{t, x). 
Our next goal is to show that one can find < xi, X2 < +00 such that 

(8.1.2) H{to, xi) = H{to, X2) = inf H{to, x) 

x>0 



and such that 



(8.1.3) 



+ ln(l + e-^°^0 > and 
"""^^ +ln(l + e-"»"^) <a. 



(it is possible that xi = X2 or that X2 = +00). 

For e in a small neighborhood of 0, we define > as a point such that 

H(to + e, Xe) = inf H{to + e, x) 

x>0 

(possibly x^ = +00). We obtain xi as a limit point of x^ as e — > 0— and X2 as a 
limit point of as e — > 0+. Clearly, (8.1.2) holds and it remains to show that 

(8.1.3) holds as well. 
Indeed, 

d 

H{To,Xi) >H{to + e, Xe) = H{to, Xe) + e — H{T, Xe) 

>H{To,Xi) +e—H{T, Xe) 

for some r between tq and tq + e and i = 1,2. 
Besides, 

§^Hir, .) = 1 + ln(l + e— ) - a) , 

from which we deduce (8.1.3). 

Additionally, from (8.1.2) we deduce that if < a^j < +00, we must have 

d 

—H{To,Xi) = 0, 

that is, 

(8.1.4) __i_ = i_F(^^ for ^ = 1,2, 

which also holds for Xi = and Xi = +00. 

Now we are ready to define Aj and Pi. Namely, we write 

a= Xi( , +ln(l + e~^°^0) where Ai,A2>0 and Ai + A2 = 1, 
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cf. (8.1.3). 

Next, we define Pi and /?2 by 



A = for 1 = 1,2. 

Let 5^ be the Hamming sphere of dimensfon A^n + o{n) and radius PiU + o{n). 
Using Theorem 1.4.5 of [Li99], we obtain 

In IS' I = ln(e^°^* + l) + ln(l + e-^°^0 + o(l) 

groK, _|_ ]^ V ' y V / 

Thus for = X 5^, we have 

In \Yn\ = an + o{n), 

as claimed. 

By Part (2) of Theorem 4.2 and (8.1.4) 

Using (8.1.2) we conclude that for y„ = 5^ x S^, we have 



n n 



\iH{to, xi) + A2-ff(ro, X2) + 0(1). 



= inf if (to, x) + 0(1) 

a;>0 

Hence, by (8.1.1), 

r(r,)<r(x,) + o(n), 

which completes the proof. □ 
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